AITopics | hot dog

Collaborating Authors

hot dog

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Chain-of-Thought Reasoning without Prompting

Neural Information Processing SystemsFeb-16-2026, 00:44:36 GMT

These methods, while effective, often involve manually intensive prompt engineering.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Leisure & Entertainment > Sports > Hockey (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Newborn African penguin named after a hot dog

The critically endangered chicks, Oscar and Duffy, were born at a New Jersey aquarium. Breakthroughs, discoveries, and DIY tips sent every weekday. An aquarium in New Jersey welcomed two new residents, just in time for the holidays. On December 20, staff at Adventure Aquarium in Camden revealed the recent births of Duffy and Oscar, a pair of African penguins () and some much needed good news in light of ongoing conservation concerns . "These milestones are incredibly important for the critically endangered African penguin population, and we couldn't be more proud to play a role in their future," the aquarium just outside of Philadelphia, Pennsylvania wrote in a social media post .

african penguin, andrew paul, penguin, (13 more...)

Popular Science

Country:

North America > United States > New Jersey (0.48)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.25)
Africa > South Africa (0.06)
(3 more...)

Industry:

Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.42)
Media > Photography (0.31)
Transportation > Air (0.30)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

Chain-of-Thought Reasoning without Prompting

Neural Information Processing SystemsOct-10-2025, 06:49:42 GMT

These methods, while effective, often involve manually intensive prompt engineering.

cot, language model, reasoning, (15 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Leisure & Entertainment > Sports > Hockey (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Aligning LLMs by Predicting Preferences from User Writing Samples

Aroca-Ouellette, Stéphane, Mackraz, Natalie, Theobald, Barry-John, Metcalf, Katherine

arXiv.org Artificial IntelligenceJun-2-2025

Accommodating human preferences is essential for creating aligned LLM agents that deliver personalized and effective interactions. Recent work has shown the potential for LLMs acting as writing agents to infer a description of user preferences. Agent alignment then comes from conditioning on the inferred preference description. However, existing methods often produce generic preference descriptions that fail to capture the unique and individualized nature of human preferences. This paper introduces PROSE, a method designed to enhance the precision of preference descriptions inferred from user writing samples. PROSE incorporates two key elements: (1) iterative refinement of inferred preferences, and (2) verification of inferred preferences across multiple user writing samples. We evaluate PROSE with several LLMs (i.e., Qwen2.5 7B and 72B Instruct, GPT-mini, and GPT-4o) on a summarization and an email writing task. We find that PROSE more accurately infers nuanced human preferences, improving the quality of the writing agent's generations over CIPHER (a state-of-the-art method for inferring preferences) by 33\%. Lastly, we demonstrate that ICL and PROSE are complementary methods, and combining them provides up to a 9\% improvement over ICL alone.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2505.23815

Country: North America > United States > Colorado (0.28)

Genre: Research Report > New Finding (0.67)

Industry: Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Tuning-free coreset Markov chain Monte Carlo

Chen, Naitong, Huggins, Jonathan H., Campbell, Trevor

arXiv.org Artificial IntelligenceOct-24-2024

A Bayesian coreset is a small, weighted subset of a data set that replaces the full data during inference to reduce computational cost. The state-of-the-art coreset construction algorithm, Coreset Markov chain Monte Carlo (Coreset MCMC), uses draws from an adaptive Markov chain targeting the coreset posterior to train the coreset weights via stochastic gradient optimization. However, the quality of the constructed coreset, and thus the quality of its posterior approximation, is sensitive to the stochastic optimization learning rate. In this work, we propose a learning-rate-free stochastic gradient optimization procedure, Hot-start Distance over Gradient (Hot DoG), Figure 1: Relative Coreset MCMC posterior approximation for training coreset weights in Coreset MCMC error (average squared coordinate-wise z-score) without user tuning effort. Empirical results using ADAM with different learning rates versus the demonstrate that Hot DoG provides higher proposed Hot DoG method (with fixed r = 0.001). Median quality posterior approximations than other values after 200,000 optimization iterations across learning-rate-free stochastic gradient methods, 10 trials are used for the relative comparison for a variety and performs competitively to optimallytuned of datasets, models, and coreset sizes.

artificial intelligence, machine learning, regression, (16 more...)

arXiv.org Artificial Intelligence

2410.18973

Country: North America > Canada > British Columbia (0.04)

Genre: Research Report (0.67)

Industry: Consumer Products & Services (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.76)

Add feedback

PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories

Aroca-Ouellette, Stephane, Mackraz, Natalie, Theobald, Barry-John, Metcalf, Katherine

arXiv.org Artificial IntelligenceOct-8-2024

Accommodating human preferences is essential for creating AI agents that deliver personalized and effective interactions. Recent work has shown the potential for LLMs to infer preferences from user interactions, but they often produce broad and generic preferences, failing to capture the unique and individualized nature of human preferences. This paper introduces PREDICT, a method designed to enhance the precision and adaptability of inferring preferences. PREDICT incorporates three key elements: (1) iterative refinement of inferred preferences, (2) decomposition of preferences into constituent components, and (3) validation of preferences across multiple trajectories. We evaluate PREDICT on two distinct environments: a gridworld setting and a new text-domain environment (PLUME).

agent, email, user example, (17 more...)

arXiv.org Artificial Intelligence

2410.06273

Country:

North America > United States > Colorado > Boulder County > Boulder (0.14)
Europe > Germany (0.04)
North America > United States > New York (0.04)
(7 more...)

Genre: Research Report (0.84)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)

Add feedback

The Dome Is Watching You

The Atlantic - TechnologyAug-29-2024, 20:33:39 GMT

On a recent Wednesday night in Los Angeles, I was ready to buy a hot dog with my face. I was at the Intuit Dome, a 2 billion entertainment complex that opened earlier this month. Soon, it will be the home of the L.A. Clippers, but I was there to watch Olivia Rodrigo, queen of teen angst, perform a sold-out show. The arena was filled with people wearing purple cowboy hats and the same silver sequin miniskirt, all of us ready to scream-sing for two hours straight. But first, we needed food.

artificial intelligence, dome, selfie, (12 more...)

The Atlantic - Technology

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.25)
North America > United States > New York (0.05)
North America > United States > Nevada > Clark County > Las Vegas (0.05)

Industry:

Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.63)
Leisure & Entertainment > Sports > Baseball (0.49)

Technology: Information Technology > Artificial Intelligence > Vision (0.37)

Add feedback

Chain-of-Thought Reasoning Without Prompting

Wang, Xuezhi, Zhou, Denny

arXiv.org Artificial IntelligenceFeb-15-2024

In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT) prompting. These methods, while effective, often involve manually intensive prompt engineering. Our study takes a novel approach by asking: Can LLMs reason effectively without prompting? Our findings reveal that, intriguingly, CoT reasoning paths can be elicited from pre-trained LLMs by simply altering the \textit{decoding} process. Rather than conventional greedy decoding, we investigate the top-$k$ alternative tokens, uncovering that CoT paths are frequently inherent in these sequences. This approach not only bypasses the confounders of prompting but also allows us to assess the LLMs' \textit{intrinsic} reasoning abilities. Moreover, we observe that the presence of a CoT in the decoding path correlates with a higher confidence in the model's decoded answer. This confidence metric effectively differentiates between CoT and non-CoT paths. Extensive empirical studies on various reasoning benchmarks show that the proposed CoT-decoding substantially outperforms the standard greedy decoding.

chain-of-thought reasoning, cot-decoding, language model, (14 more...)

arXiv.org Artificial Intelligence

2402.102

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(6 more...)

Genre: Research Report > New Finding (0.66)

Industry: Leisure & Entertainment > Sports > Hockey (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Jointly Training Large Autoregressive Multimodal Models

Aiello, Emanuele, Yu, Lili, Nie, Yixin, Aghajanyan, Armen, Oguz, Barlas

arXiv.org Artificial IntelligenceSep-28-2023

In recent years, advances in the large-scale pretraining of language and text-toimage models have revolutionized the field of machine learning. Yet, integrating these two modalities into a single, robust model capable of generating seamless multimodal outputs remains a significant challenge. To address this gap, we present the Joint Autoregressive Mixture (JAM) framework, a modular approach that systematically fuses existing text and image generation models. We also introduce a specialized, data-efficient instruction-tuning strategy, tailored for mixedmodal generation tasks. Our final instruct-tuned model demonstrates unparalleled performance in generating high-quality multimodal outputs and represents the first model explicitly designed for this purpose. Autoregressive text-to-image models, as exemplified by works such as Yu et al. (2023; 2022), have made remarkable strides in generating highly detailed images, paralleling the achievements of Diffusion Models Nichol et al. (2022); ...

arxiv preprint arxiv, instruction, language model, (14 more...)

arXiv.org Artificial Intelligence

2309.15564

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Hawaii > Honolulu County (0.04)
South America > French Guiana > Guyane > Cayenne (0.04)
(3 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine > Consumer Health (1.00)
Education (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.46)

Add feedback

ChatGPT is everywhere. Here's where it came from

MIT Technology ReviewFeb-8-2023, 19:34:18 GMT

ChatGPT is a version of GPT-3, a large language model also developed by OpenAI. Language models are a type of neural network that has been trained on lots and lots of text. Because text is made up of sequences of letters and words of varying lengths, language models require a type of neural network that can make sense of that kind of data. Recurrent neural networks, invented in the 1980s, can handle sequences of words, but they are slow to train and can forget previous words in a sequence. In 1997, computer scientists Sepp Hochreiter and Jürgen Schmidhuber fixed this by inventing LTSM (Long Short-Term Memory) networks, recurrent neural networks with special components that allowed past data in an input sequence to be retained for longer. LTSMs could handle strings of text several hundred words long, but their language skills were limited.

language model, neural network, sequence, (9 more...)

MIT Technology Review

AI-Alerts: 2023 > 2023-02 > AAAI AI-Alert for Feb 14, 2023 (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.31)

Add feedback